Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Comparative Behaviour of Recent Incremental and Non-incremental Clustering Methods on Text: An Extended Study

Identifieur interne : 002736 ( Main/Exploration ); précédent : 002735; suivant : 002737

Comparative Behaviour of Recent Incremental and Non-incremental Clustering Methods on Text: An Extended Study

Auteurs : Jean-Charles Lamirel [France] ; Raghvendra Mall [Inde] ; Mumtaz Ahmad [France]

Source :

RBID : ISTEX:6A5356072F6EB6B6173A6EC0874B6D12CC0BF1A3

Abstract

Abstract: This paper represents an attempt to throw some light on the quality and on the defects of some recent clustering methods, either they are incremental or not, on “real world data”. An extended evaluation of the methods is achieved through the use of textual datasets of increasing complexity. The third test dataset is a highly polythematic dataset that figures out a static simulation of evolving data. It thus represents an interesting benchmark for comparing the behaviour of incremental and non incremental methods. The focus is put on neural clustering methods but the standard K-means method is included as reference in the comparison. Generic quality measures are used for quality evaluation.

Url:
DOI: 10.1007/978-3-642-21822-4_3


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Comparative Behaviour of Recent Incremental and Non-incremental Clustering Methods on Text: An Extended Study</title>
<author>
<name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
<affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="team" n="7">Synalp (Loria)</orgName>
<orgName type="lab">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="EPST">Centre national de la recherche scientifique</orgName>
</affiliation>
</author>
<author>
<name sortKey="Mall, Raghvendra" sort="Mall, Raghvendra" uniqKey="Mall R" first="Raghvendra" last="Mall">Raghvendra Mall</name>
</author>
<author>
<name sortKey="Ahmad, Mumtaz" sort="Ahmad, Mumtaz" uniqKey="Ahmad M" first="Mumtaz" last="Ahmad">Mumtaz Ahmad</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:6A5356072F6EB6B6173A6EC0874B6D12CC0BF1A3</idno>
<date when="2011" year="2011">2011</date>
<idno type="doi">10.1007/978-3-642-21822-4_3</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HCB-JT6TSTW2-X/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001880</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">001880</idno>
<idno type="wicri:Area/Istex/Curation">001861</idno>
<idno type="wicri:Area/Istex/Checkpoint">000636</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000636</idno>
<idno type="wicri:doubleKey">0302-9743:2011:Lamirel J:comparative:behaviour:of</idno>
<idno type="wicri:Area/Main/Merge">002778</idno>
<idno type="wicri:Area/Main/Curation">002736</idno>
<idno type="wicri:Area/Main/Exploration">002736</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Comparative Behaviour of Recent Incremental and Non-incremental Clustering Methods on Text: An Extended Study</title>
<author>
<name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA, Campus Scientifique, BP 239, Vandoeuvre-lés-Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
<settlement type="city" wicri:auto="agglo">Nancy</settlement>
</placeName>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="team" n="7">Synalp (Loria)</orgName>
<orgName type="lab">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="EPST">Centre national de la recherche scientifique</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="team" n="7">Synalp (Loria)</orgName>
<orgName type="lab">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="EPST">Centre national de la recherche scientifique</orgName>
</affiliation>
</author>
<author>
<name sortKey="Mall, Raghvendra" sort="Mall, Raghvendra" uniqKey="Mall R" first="Raghvendra" last="Mall">Raghvendra Mall</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Inde</country>
<wicri:regionArea>Center of Data Engineering, IIIT Hyderabad, NBH-61, Hyderabad, Andhra Pradesh</wicri:regionArea>
<wicri:noRegion>Andhra Pradesh</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Inde</country>
</affiliation>
</author>
<author>
<name sortKey="Ahmad, Mumtaz" sort="Ahmad, Mumtaz" uniqKey="Ahmad M" first="Mumtaz" last="Ahmad">Mumtaz Ahmad</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA, Campus Scientifique, BP 239, Vandoeuvre-lés-Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
<settlement type="city" wicri:auto="agglo">Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s" type="main" xml:lang="en">Lecture Notes in Computer Science</title>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: This paper represents an attempt to throw some light on the quality and on the defects of some recent clustering methods, either they are incremental or not, on “real world data”. An extended evaluation of the methods is achieved through the use of textual datasets of increasing complexity. The third test dataset is a highly polythematic dataset that figures out a static simulation of evolving data. It thus represents an interesting benchmark for comparing the behaviour of incremental and non incremental methods. The focus is put on neural clustering methods but the standard K-means method is included as reference in the comparison. Generic quality measures are used for quality evaluation.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
<li>Inde</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement>
<li>Nancy</li>
<li>Vandœuvre-lès-Nancy</li>
</settlement>
<orgName>
<li>Centre national de la recherche scientifique</li>
<li>Laboratoire lorrain de recherche en informatique et ses applications</li>
<li>Synalp (Loria)</li>
<li>Université de Lorraine</li>
</orgName>
</list>
<tree>
<country name="France">
<region name="Grand Est">
<name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
</region>
<name sortKey="Ahmad, Mumtaz" sort="Ahmad, Mumtaz" uniqKey="Ahmad M" first="Mumtaz" last="Ahmad">Mumtaz Ahmad</name>
<name sortKey="Ahmad, Mumtaz" sort="Ahmad, Mumtaz" uniqKey="Ahmad M" first="Mumtaz" last="Ahmad">Mumtaz Ahmad</name>
<name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
</country>
<country name="Inde">
<noRegion>
<name sortKey="Mall, Raghvendra" sort="Mall, Raghvendra" uniqKey="Mall R" first="Raghvendra" last="Mall">Raghvendra Mall</name>
</noRegion>
<name sortKey="Mall, Raghvendra" sort="Mall, Raghvendra" uniqKey="Mall R" first="Raghvendra" last="Mall">Raghvendra Mall</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002736 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002736 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:6A5356072F6EB6B6173A6EC0874B6D12CC0BF1A3
   |texte=   Comparative Behaviour of Recent Incremental and Non-incremental Clustering Methods on Text: An Extended Study
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022